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Abstract 



>0 , We study a type of calculus for proving inequalities between subgraph densities which is based 

on Jensen's inequality for the logarithmic function. As a demonstration of the method we verify 
the conjecture of Erdos-Simonovits and Sidorenko for new families of graphs. In particular we give 
(-H [ a short analytic proof for a result by Conlon, Fox and Sudakov. Using this, we prove the forcing 

C^ , conjecture for bipartite graphs in which one vertex is complete to the other side. 



> : 1 Introduction 

m 
in 

Inequalities between subgraph densities is subject of extensive study. Many problems in extremal 
graph theory can be formulated in this language. Subgraph densities can be conveniently defined 

^D [ through graph homomorphisms. A graph homomorphism between finite graphs i7 = {V{H),E{H)) 

and G = {V{G), E(G)) is a map (f) : V{H) -^ V{G) such that the image of every edge in H is an 
edge in G. The subgraph density t{H, G) is the probability that a random map : V{H) — > V{G) 

^% ' is a graph homomorphism. 

5^ , In the frame of the graph limit theory 16J G can be replaced by an analytic object which is a 

two variable symmetric measurable function. Many statements in graph theory can be equivalently 
stated in this analytic language as follows. Let (il, ji) be a probabihty space and VF : fi^ — > M be a 
bounded measurable function such that W{x^ y) = W{y, x) for every pair x, y G O. If the range of 
W is in the interval [0, 1] then it is called a graphon. Let 

t{H,W)=¥.{ H W{x,,x,)) (1) 

(x,,Xj)eE{H) 

where V{H) = {-Ti}"^! are independently chosen from Q,. It is easy to see that if fi = V{G) 
with the uniform distribution and Wg ■ V{G) x V{G) — !> {0, 1} is the adjacency matrix of G then 
i(iJ, G) = t{H, Wg). 



The Cauchy-Schwartz inequality is a fundamental tool in establishing inequalities between sub- 
graph densities. It was shown in ||6l that every true inequality between subgraph densities is a 
consequence of a possibly infinite number of Cauchy-Schwartz inequalities. As a negative result 
Hatami and Norin lH proved the undecidability of the general problem. 

Motivated by earlier work on entropy calculations ||5] in this area, we develop a method using 
Jensen's inequality for logarithmic functions. In particular we use the concavity of 2 h-). In z and 
the convexity of z t-^ z\nz. Similarly to the Cauchy-Schwartz calculus, the logarithmic calculus 
is capable of proving inequalities through a line of symbolic calculations without verbal arguments. 
This can make it suitable for computer based algorithms designed to find new inequalities. 

We demonstrate the power of the logarithmic calculus approach on a fundamental problem 
known as Sidorenko's conjecture (also conjectured by Erdos and Simonovits [iTO|). In the com- 
binatorial language the conjecture says that following. 

Conjecture 1 Let H be a bipartite graph and G be an arbitrary graph. Then t{H, G) > t{P2, GY 
where P2 is the single edge and e is the number of edges in H. 

Phrased originally by Sidorenko in the analytic language ||2l, the conjecture says that t{H, W) > 
t{P2, Wy holds for every bounded non-negative measurable function W. Expressions of the form 
t{H, W) appear as Mayer integrals in statistical mechanics, Feynman integrals in quantum field 
theory and multicenter integrals in quantum chemistry. The conjecture is also related to many topics 
such as Markov chains H], matrix theory and quasi-randomness. Note that in the analytic form 
if W is constant then t{H, W) = t{P2, WY ■ The forcing conjecture rephrased in this analytic 
language says that if H is not a tree then this is the only case when equality holds. Note that this is 
a refinement of Sidorenko's conjecture. 

Sidorenko's conjecture has been verified for various families of graphs, including even cycles, 
trees IJS], and hypercubes [S). Most recently the conjecture has been proven by Sudakov, Conlon 
and Fox IS] for bipartite graphs with one vertex complete to the other side. The proof uses the 
tensor power trick and a certain probabilistic method called dependent random choice. As a quick 
demonstration of the logorithmic calculus we give an analytic proof of this result. Essentially the 
same proof also implies the forcing conjecture for the same graphs. Note that the forcing conjecture 
for bipartite graphs in which two vertices are complete to the other side was proved by Conlon, 
Sudakov and Fox JS). 

The logarithmic calculus also yields stronger intermediate inequalities that are preserved under 
'gluing operations' and thus provide an iterative method for proving Sidorenko's conjecture for 
many new graphs. A theorem of this type is the following. We call a graph a reflection tree (for 
precise definition see chapter |3]l if it can be obtained from a tree by gluing reflected versions of 
subtrees on it. They include the bipartite graphs considered by Conlon, Fox and Sudakov. Let us 



call a bipartite graph H a Sidorenko graph if t{H, W) > t{P2, WY for every graphon W . 

Theorem 1 Reflection trees are Sidorenko. 

Another theorem that we prove is the following. 

Theorem 2 (Edge gluing) Let Hi,H2 be Sidorenko graphs and ei e E{Hi),e2 & E{H2) be 
arbitrary edges. Then the graph H obtained frorn Hi and H2 by identifying ei and 62 is also a 
Sidorenko graph. 

Note that in the above theorem the edges ei and 62 can be identified in two different ways. 

2 Basics of Logarithmic Calculus 

Lemma 2.1 (Jensen's inequality) Let (r2, fi) be a probability space, let c be a convex (resp. con- 
cave) function on an interval Z? C M and g : fl ^ D be a measurable function. Then 

IE(c(g)) > c(E(g)) {convex) , E{c{g)) < c{E{g)) (concave) (2) 

Moreover ;/£(/) = 1/or sorne non-negative function f on il then also 

]E(/c(.9)) > c(E(/,9)) (convex) , E{fc{g)) < c{E(fg)) (concave) (3) 

Ifc is a strictly convex (concave) function then equality in Q} is only possible if g is constant on the 
support of f. 

Proof. Jensen's inequaUty is a classical result whose proof is based on the intuitively clear fact that 
if a probability measure is concentrated on a convex (resp. concave) curve then the center of mass 
is above (resp. below) the curve. The inequality (O is a direct consequence of (|2]i if we consider / 
as the density function of a new measure /i* on il. 

We introduce some notation. Let g{xi,X2, . ■ . , x„) be a function on [0, 1]". For a subset S C 
{xi, a;2, . . . ,Xn} of the variables the we introduce the |5|-variable function 

Es(g)= j g n '^^- 

Notice that E(g) = E(E5(g)). Tosimplify notation we will identify vertices of graphs with variables 
representing them in the formula ([T]i- As the first illustration of the logarithmic calculus we start by 
the Blakley-Roy inequaUty. 

Proposition 2.1 (Blakley-Roy) Let W : [0, 1]^ -^ M+ be a bounded symmetric measurable func- 
tion and d = E(W). Then t(Pn, W) > d". 



Proof. Let d{x) = E^(P^(x, y)) and 

n n 

/ = (n w{x,,x,+^)) d-' (j[ d{x,)-') . 

i=l i=2 

We have that E(/) = 1. 

n n n 

\nt{Pn,W) =lnE(/dJ|d(a;,)) > Ind + E(/ lnj| d(a;,)) = hid + ^E(/ lnd(a-,)) == 

i=2 i=2 8=2 

n 71 

= lnd + ^E(E^,(/lnd(.T,;))) = hid + ^d"iE(d(x,) lnd(x,)) > nlnd. 

4=2 1=2 

The first inequality follows from ^ with 0(2;) = In z and the second inequality follows from (|2]l 
with c{z) = z \n z. 

Theorem 3 (Conlon-Fox-Sudakov) Let H be the {bipartite) graph on the vertex set {x, yi,y2, ■ ■ ■ ■, ym, vi,V2t ■ ■ ■ ,Vk} 
such that X is connected to vi,V2t ■ ■ ,Vk and yt is connected to the vertices St C {wi, 'y2, . • • , Wfc} 
where \St\ = at- Let e = fc + X^t^i '^t ^^ ^''^ total number of edges in H. Then ifW : [0, 1]^ — S> M"*" 
is a measurable function and d ~ E(VF). Then t{H, W) > d'^ . 

Proof. Let 

k 
q^^,{W{x,z)) and ! = d-^q^-''\{W{x,v,), 

1=1 

St-Es,( n ^(^'«j)) and ft ^ si\''^-''\{w{x,v,) (i == 1, 2, . . . ,to). 

Vj£St i=l 

Notice that E(/) = E(/t) = 1. Using (|3]l with c{z) —\nz and (l2|i with c(z) = z In z we have 



lnt(if, W^) = lnE(/9'=-id[].s,) > EahiCg^^-idJJs,)) 

j=i i=i 

= (fc - l)E(E,(/lng)) +E(/lnd) +^E(/lnsO = 



i=l 



= (fc- l)d~^E(qhi<7)+lnd + ^E(/lnSj) > fclnd + ^ E(/ his,). 

i=l 4=1 

Let /it = 54(7!-"*. ThenbyE^(/t/it) =q,E{ftht) == d and Q,© with c(z) = zlnz, 
E(/lnst) = d-'Eiftht\nht) + (at - l)d-^E{E,{ftht\nq)) = 
= d-'^ (E{ftht\nht) + {at -l)E{q\nq)) > atlnd. 

Remark: In the proof of theorem [3] it is convenient to assume that W is strictly positive. Then 
using the continuity of t{H, W) under L^ convergence we obtain the Sidorenko inequality for every 



non-negative W. However it is a useful observation that with some extra caution all the steps of 
the proof work for non-negative functions. The reason for this is that if we replace all the weight 
function / and ft occurring in the proof by modified versions /' and fj. in which values that are 
not defined are replaced by then the same calculations remain true. This is a simple case by case 
checking. 

Theorem 4 (Forcing for C-F-S graphs) The forcing conjecture holds for bipartite graphs in which 
one vertex is complete to the other side (and are not trees). 

Proof. We use the notation from the proof of theorem|3]. In this proof we allow W to take the value 
0. Let /' and f[ be as in the remark following the proof of Theorem|3] Assume that t{H, W) = dP. 
Then in the proof of theorem|3]all the inequalities become equalities. We immediately obtain that 
E((7 In q) = d Ind and thus q has to be constant d on [0, 1]. If H is not a tree then there is a number 
1 < .? < fc such that Oj > 2. We obtain that E(/j/ij In hj) = d In d and thus hj has to be constant d 
on the support of /'. This implies that Sj is constant d"* on the support of /' which is equal to the 
support of Y[i=i W{x, Vi). This implies that if the value of Sj is not then it is equal to (i°*. On 
the other hand (since spiders are Sidorenko) we have E{st) > d"-* and thus Sj is constant d°-* . This 
implies that t(i^2,at7 W) = IE(.Sj ) ~ d^"*. Using that /\2.at is forcing the proof is complete. 

3 Smoothness and Gluing 

In general, we consider the logarithmic calculus as a symbolic way of proving inequalities between 
subgraph densities using conditional expectations and Jensen's inequality for z In z and In z. In this 
chapter we give a demonstration by proving Sidorenko's conjecture for a family of bipartite graphs. 
However this is not the limitation of the method inside Sidorenko's conjecture and there are many 
applications outside Sidorenko's conjecture. Further applications will be discussed in a subsequent 
paper 

The proof of theorem[3]relies on the interesting phenomenon that certain bipartite graphs satisfy 
stronger inequaUties than Sidorenko's. Note that the form of these new types of inequalities hints 
at useful reinterpretation as geometric averages (detailed later). More specifically we showed that 
if H is the graph on the vertex set {x, vi, ..., Vk,y}, where x is connected to vi, . . . ,Vk and y is 
connected to a subset 5* of {wi, W2, ■ ■ ■ ,Vk} then it satisfies the inequality 

n 

E(d-^q'''-^Y[W{x,v,)\ns\ >|S'|lnd (4) 

1=1 

where s = Esflli) gs ^^2^' ^i)) and d and q are as in the proof of the theorem. This inequality 
^ allows one to glue together such graphs H with various choices of S along the star spanned 



on {x, vi,V2, ■ ■ ■ ,Vk} and the resulting graph will still be Sidorenko. Through this operation we 
can build the type of graphs considered by Sudakov, Conlon and Fox. This indicates that ^ type 
inequalities can be used to produce new Sidorenko graphs by gluing. Our goal is to generalize this 
situation. For ease of expression, we first introduce restricted subgraph densities. 

Let C/„ denotethesetof graphs in which 71 different vertices are labeled by the numbers {1,2, ... ,n}. 
If Hi and H2 are in Qn then their product H1H2 is defined as the graph obtained form them by 
identifying vertices with the same label and then reducing multiple edges. The notion of subgraph 
density can be naturally extended for graphs in Qn. 

Definition 3.1 (Restricted subgraph density) Let H e Qnbea graph on the vertex set {xi,X2,. .. ,Xm} 
such that the labeled vertices are S ~ {xi ,X2, ■ . ■ ,Xn}. Then the restricted subgraph density of H 
in W : [0, 1]^ — > M is the n-variable function defined as 

ts{H,W)=Es{ n Wix„x,)). (5) 

ix,,Xj)eE{H} 

Lemma 3.1 Let T be a tree on the vertex set {xi ,X2, . ■ . ,Xn} and let 

n 

.fT = d~^l[d{x,)^-''' n W{x,,xj). (6) 

1=1 (xi,Xj)eE{T) 

Then for an arbitrary 1 < i < n we have thatKxiifr) = d{xi)/d and consequently Klfx) ~ 1. 

Proof. Ifn = 1 then the statement is trivial. By induction assume that it holds for ri — l. Assume that 
n > 1 and Xj y^ 2;^ is a leaf in T. Let S = {xi, . . . ,a;„} \ {xj}. Then Exiifr) = ExiiEsifr)) = 
Ea; . {fx' ) where T' is obtained from T by deleting Xj and thus the induction step finishes the proof. 
We are now ready to define the notion of smoothness. 

Definition 3.2 (Smoothness) Let H £ Qn be a bipartite graph on the vertex set {xi, X2, ■ . ■ , Xm} 
such that the spanned subgraph on S = {xi, X2, . ■ . ,Xn} is a tree T. We say that H is smooth (or 
T is smooth in H) if 

Ef/T lnts{H*,W)) > \E{H*)\\nd 



where H* is the graph obtained from H by deleting the edges in T. 

Note that if n = then T is empty. In this case the smoothness of H is equivalent with the 
Sidorenko property. The next two lemmas together show that smoothness in general, is a strength- 
ening of the Sidorenko property. 

Lemma 3.2 (Unlabeling) Let H G Qn be a smooth bipartite graph with tree T spanned on the 
labeled vertices S and let T' be a non-empty sub-tree spanned on S' C S. Then the graph H2 
obtained frorn H by unlabeling the vertices in S \ S' is also smooth. 



Proof. It is enough to prove that unlabehng one leaf in H preserves smoothness. Every other case 
can be obtained by iterating this step. Assume that a;„ is a leaf connected to X„_i. We have that 

e(/t' lnis,(i7*,VK)) =e(/t' \n¥.,SW{xn,Xn-i)ts{H\W))) = 

= e(/t' \n¥.^SW{xn,Xn~i)d{xn-i)-^d{xn-i)ts{H\W))) > 
E(/t' lnd(a;„_i))+E(/T \nts{H* ,W)) >E{fT' \nd{xn-i)) + \E{H*)\hid. 
In the above calculation we use the concavity of z — > hi(z) with weight function W{xn,Xn-i)d{xn-i)~^- 



Using lemma QT] we get 

E(/t' \nd{xn^i)) = E(E,„_,(/t' lnd(a:„_i))) = d-^E{d{xn-i)\nd{xn-i)) > hid. 

Lemma 3.3 Assume that (xi, X2) G E{H) and H £ Q2 is smooth. Then H is a Sidorenko graph. 
Proof. We have that 

lnE(t(iJ,W^)) = \nd + \nE{W{xi, X2)d-^t{H* ,W)) > 
>hid + E{W{xi,X2)d'^ \nt{H* ,W)) >hid+ \E{H*)\lnd = \E{H)\lnd. 



Lemma I3T2I and lemma l3T3l together imply the following important corollary. 

Corollary 3.1 Let H G Gn be a smooth graph. Then H is Sidorenko. 

The next lemma shows how to produce now Sidorenko graphs from old ones using smoothness. 

Lemma 3.4 (Gluing on smooth trees) Let Hi , H2 G Qn be two smooth graphs such that the trees 
spanned on the labeled vertices are identical with T in both graphs. Then H = H1H2 G Qn is also 
smooth. 

Proof. 

E{fT\nts{H\W)) ^E{fT\nts{Hl,W)ts{H*2,W)) ^E{fT\nts{Hl,W))+E{fT\nts{H*2,W)) 
> i\E{Hi)\ -n + l)\nd+{\E{H2)\ -n + l)hid= {\E{HiH2)\ -n + l)\nd. 



Lemma 3.5 (Extension of smooth part) Let H G Qn, {n> 1) be a smooth graph and let T' G Qn 
be a tree such that H and T' induce the same tree T on the labeled points. Then the graph H2 
obtained from HT' by putting labels on all the vertices in T' is again smooth. 

7 



Proof. It is enough to prove the statement for the case where y = V{T') \ V{T) is a single vertex 
which is a leaf in T' . The general case is an iteration of this step. Without loss of generality assume 
that y is connected to x„. Let S = {xi, X2, . . . , x„}, S" = S" U {y} and s — ts'{H2, W). In this 
case 

E(/t' Ins) = E{Es{.fT' Ins)) = E(/t Ins) = E(/t In ts(i/*, M^)) > 

>\H*\\nd^ \H;\lnd. 



Definition 3.3 (Reflection) Reflection of a subgraph H2 induced on K G V{H) in H along an 
independent set S <Z K is the operation which produces the graph HH2 where the vertices in K 
are labeled in both H and H2 in the same way. 

Lemma 3.6 Let T G Q^ be a tree such that the labeled points S are independent. Let H be the 
graph obtained from T^ by labeling all points in one copy ofT. Then H is smooth. 

Proof. The statement is obviously equivalent with the following one. Let T be a tree on M = 

{xi,a;2, . . . , Xm,} and let S = {xi,a;2, . . . , a;„} C 1\I. Then 

E{fThits{T,W))>\E{T)\\nd. 
Let s = ts{T, W) and q = llT=i dixiY^'^. Then 

E{fTlns) ^ d-^E{tMiT,W)s-'^ {sq~'^)ln{sq-'^)) + d-^E{tM{T,W)s-hq-Hiiq). 
> lnd + ^d-\n - l)E{Ea,,itMiT,W)s-'^sq-^lnd{xi))) = 

m m 

= ln(i + ^d~i(r, - l)E{d{x i) In d{xi)) > lnrf + ^(ri -l)lnd= (m-l)lnd. 



To illustrate lemma |3T6| on an example, let T be the path of length m such that the two endpoints 
are labeled. Then T^ is the cycle C2m of length 2m. The lemma implies that a path of length m is 
smooth inside C2m- Then lemma [32] shows that any path of length at most m is also smooth inside 
C2m and thus in particular edges are smooth. It follows from Lemma [34] that if we glue together 
even cycles along an edge then the resulting graph is Sidorenko. We will see later that this is true 
with arbitrary Sidorenko graphs. 

Now we introduce a class of graphs and we call them reflection trees. They include trees, even 
cycles, and bipartite graphs in which one vertex is complete to the other side. We prove that reflec- 
tion trees are Sidorenko. 

Definition 3.4 (Reflection tree) A reflection tree is a graph obtained from a tree T by applying the 
reflection operation to a collection of sub-trees in T. 

8 



Two examples for reflection trees are even cycles and bipartite graphs in which one vertex is 
complete to the other side. Even cycles are obtained by reflecting a path and the other example 
is obtained from a star reflecting sub-stars. Now we are ready to prove that reflection trees are 
Sidorenko. 

Proof of Theorem\J] Lemma l376l and lemma |33] together show that in any graph obtained from a 
tree T by reflecting a subtree the tree T is smooth. Then lemma U4l finishes the proof. 



4 Smoothness of edges 

To avoid complications, our original definition for smoothness used functions W that are strictly 
positive. However smoothness can be equivalently defined in purely graph theoretic terms. Us- 
ing the notation from the previous chapter let H G Qn be a bipartite graph on the vertex set 
{si, a;2, . . . , Xm} such that the spanned subgraph on 5 = {xi, 0:2, . . . , a;„} is a tree T. Let G 
be a finite graph (replacing W). Homomorphisms from H (resp. T) to G can be looked at as as- 
signments of values from V{G) to the variables {a;i, X2, . . . , Xm} (resp {xi, X2, . . . , a;„}). Let fir 
denote the probability distribution on V{G)" defined by the density function /y. The places where 
/t is not defined are set to 0. Intuitively, the distribution ^t is a random copy of T built up in G by 
first choosing a random edge e and then growing T by adding leaves to the existing configuration 
one by one in a random way. This can also be considered a branching random walk with structure 
T started from a random edge. The inequality defining smoothness becomes 

E{lnts{H*,G))>\E{H*)\\nd. (7) 

where the expected value is taken with respect to the distribution fiT- It can be seen from standard 
methods in graph limit theory that the two definitions are equivalent. From Q we can immediately 
see that for every (homomorphic) copy of T in G the restricted homomorphism density ts {H* , G) 
can't be 0. This gives an interesting topological obstruction to smoothness. 

Definition 4.1 (Retract) Let H be a graph and S C V{H) be a subset of its vertices. We say that 
S (Or the graph H2 spanned on S) is a retract of H if there is a graph homomorphism (j) : H ^f H2 
such that (j) restricted to H2 is the identity map. 

It is easy to see that H2 is a retract if and only if any graph homomorphism 0' : H2 — > G into 
an arbitrary graph G extends to a graph homomorphism (p : H ^ G. The next lemma follows 
immediately from O. 

Lemma 4.1 If a tree T spanned on the vertex set S C V{H) in a graph H is smooth then T is a 
retract of H. 



The following natural conjecture arises. 
Conjecture 2 If H is a Sidorenko graph and a tree T is a retract of H then T is smooth in H. 

We prove the following special case of the above conjecture. 
Theorem 5 IfH is a Sidorenko graph then every edge is smooth in H. 

Proof. Let G be a finite graph with edge density d. We denote by Gk the fc-th tensor power of G. 
Assume that e is a fixed edge in H and that H has a + 1 edges. Let H* he the graph obtained from 
H by removing the edge e and labehng the two endpoints by {1,2}. Let Ek denote the set of edges 
/ in Gk for which tf{H*,Gk) > d''" /2. 

Let ilk be the probability space of a randomly chosen edge in Gk and let Xk be the random 
variable tf{H* , Gk) on flk- It is clear from the definition of Gk that the distribution of Xk is the 
product of k independent copies of Xi. Let e > be an arbitrary number By the law of large 
numbers, if k is big enough then 

P(|(lnXfe)/fc - E(lnXi)| > e) < e. 

Now we estimate the probability P(-Efc) = l-^fe|/l-E'(G'fe)| in the probability space ilk- Let 
G'f, denote the graph obtained from Gk by deleting the edge set Ek ■ The edge density of G'j. is 
equal to d''{l - V{Ek)) and so using the fact that H is Sidorenko it follows that t{H,G'^) > 
<i(°+i)'^(l - P(Sfc))(°+i). On the other hand tf{H\G'^) < d^" /2 for every edge / in G'^ and 
thus t{H,G'^) < d''{l - F{Ek))d'"'/2. We obtain that 1/2 > (1 - F{Ek))'' and thus ¥{Ek) > 
1 - 2-° > 0. 

Notice that the lower bound for F{Ek) does not depend on k. It follows that if e < 1 — 2^° and 
k is sufficiently big then the event Ek intersect the event | (In Xk ) / k — E,{ln Xi)\ < e and thus 

E(lnXi) > ln(rf'="/2)/A: - e 

holds for all such choices of e and k. Consequently E(ln Xi) > a In d. This inequality is equivalent 
with the smoothness of e. 

Proof of theorem^ The statement is a direct consequence of theorem|5]and lemma lsT?! 
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